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A Method for Mapping the Active Sites Bound by Exiles that Covaiently Modify 

Substrate Molecules 



Field of Invention 



This invention is immediately relevant to many medical fields including inflammation, 
autoimmunity, transplantation, cancer, anfi-microbiais, virology, metabolic disease and 
allergy. Methods to identify selective substrates of specific enzymes are indicated, based on 
the detailed mapping of the substrate binding site using combinatorial peptide • libraries. 
These enzymes can be any molecule that covaiently modifies its physioloeical subsmre 
target, examples of which include, but are not limited to, protein kinases, crotein 
phosphatases, acerylases and ribosyiases. These derived substrate-based compounds can 
serve as a basis for further medicinal chemistry development of selective enzvme 
inhfoitors. The identification of short pepndic substrates using this methodology will also 
allow for the rapid development of high throughput screens for compound screening 

Background to Invention 

The mapping method is exemplified using members of the protein kinase enzyme familv, 
but this method is applicable to other covaiently modifying enzymes. 

Phosphate transfer (phosphorylation) is the most common form of covaleat orctein 
modification used by cells. Protein kinases are the enzymes that catalyse the transfer of the 
y-phosphate from adenosine . triphosphate (AT?) to an ammo acid residue (usually tyrosine, 
threonine, serine or histidine) on a substrate molecule. Approximately 400 kinases ar- 
currently known, and it is likely that this number will increase considembiv in the next few 



yen* as more information from gene sequencing abases becomes arable 
Functionally, the* mdecute „ fam=Huiar enzymes that pl iy key roies j, c=i1 ^ 
tfcerencanon and inur^a ce-maie** Abena,, protein ^ lctivirv ^ b „ 
unpucated in may disease states including several fonns of cancer and severe-combin-H 
unmunodencicacy disease (Barker « ml 1997; lehtola « m; Amaia » a/ 199a- Eld- e- 
1995; Roiton, 1995). Similarly, activation of protein kinase act,™, within moncnu-- 
cells ,s matured to drive the cytokine production which underlies manv autoimmune 
dtseases (Lee „ al, 1994). Thus, inhibitor compounds capable of specifically inactivate 
certam crmcal leases may have considerable therapeutic benefit in a number of clinical 
diseases. 

All tyrosine and serme'threonme protein kinases have a region of approximated 300 amino 
ac:os known as the catalytic subunit which has evolved irom a common ancestor kinase 
(Hanks and Qumn, 1991). Crystal structure determination of several kmases has shown that 
they ail have a common bi-lobal structure (Wilson e: at, 1996; Zhans „ al, 1994; Xu r a> 
1997). me ammo-terminal part of the subunit encodes a small lobe resoonsibie fir th- 
bmdmg of ATP, whereas the carboxy-terminal residues encode a larser lobe imnortanr for 
protem substrate bmding. In the. tertiary structure of the active kinase, both the AT? and 
the protein substrate binding sues are brought together allowins transfer of the AT- v- 
phosphate to the amino acid acceptor on the protein substrate. The protein/peotide binding 
groove stretches across the- face of the large lobe between two a-helices and under the 
anaUtobe. This groove therefore contains the residues important for denning the subsimte 
specificiry of the kinase. 

Many protem kinases are arranged in kinase CISCadss ^ ^ ^ ^ ^ 

for stgnal amplification in por.-transduction pathways. Tnis ampfification relies or the 
upsmeam kinase specifically activating its downsu-eam parmer. For this reason, orotein 



kinases have developed remarkable sitae specificities which prevent unwanted 
crosstalk between dinerent kinase cascades. Tms substrate spec'ificay can be exploited in' 
the design of selective protein kinase inhibitors 

A technique has recently been developed by L. Cantley's laboratory to provide, consensus 
peptide protein kinase substrate information (WO 95/18823, Songyang er al, 1994). Fir3 . a 
degenerate library of peptides with a phospho-acceptor such as tyrosine or serine/threonine 
flanked by amino acids on each side is synthesised. A preferred number of degenerate 
residues on each side of the phosphorylation site is four (corresponding to positions"-!, -2, - 
3, -4, +1, +2, -3, +4) relative to the phosphoryiated residue. Thus the library consists of 
peptides having a length of nine amino acids. The library is then phosphoryiated by the 
protein kinase of interest and phosphoryiated peptides isolated from the non- 
phosphoryiated peptides by DEAE-sephacel and ferric chelation chrornatogmuhv. The 
phosphopeptide mixture is then sequenced and the frequency of each amino acid at every 
position assessed to give a preferred substrate sequence. These studies have yielded 
consensus substrate information, but do not allow a detailed analysis of particular 
preferences for neighbouring residue interactions as pools of peptides are examined 
Furthermore, - this type of analysis may not show up rare good substrates which^ouid be 
hidden by the presence of numerous poor substrates in the peptide pool. By this method 
individual peptides can never be identified as individual sequences, the result is that an 
average picture of substrate specificity is reached. Pan of the problem is that each 
mtfvidual peptide is represented at such a low level, and many inevitably will not even be 
present. Fhe results from Cantley's method do not represent individual peptides but a 
consensus picture of protein kinase substrate specificity. 

Filamentous phage expressing gene El-linked degenerate peptide sequences have also been 
used to generate substrate information (Schinitz e al, 1996; Dents e: al, 1997), however 



to memoo is labour mrenaive d0K Mt ^ ^ ^ of 

to miotmanon for the design of successful theraoeuuc c-'I n ,m. ' . . . ~ 

inmoitors has tailed (Kemp « a/, 1991). ' w 



We beiieve that idsarincaaon of derailed su^sm- r • • 

L . . su.szrar. c-jacrsnsacs can innmareiv rr.aa *- 

n = ^oove and provme tmcrmanon thai can lead to the assist of . 
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Summary of Invention 



m, „n provides for the acdve m2pptog of TO whi= . ^^.^ 
modrncauon including, but uninsured to ohosphorvlahon, acyiation, denhoanho^lafion in 
wh.cn a axed residue (hereafter Known as the caraiydc restdue, such as a rvrosine sein-' 
tae. hrandine, asparnc acid residue or any other residue containing „ an.roorije' 
stde «■» » modified The meihod of the mvenhou has „ addihonal level of como^rv 
over and above that of the sel^econvolurmg libraries described in WO^flS^ 
content of which is incorporated herein by reference, where legally pemfisaibie). A 

This involves maicmg a horary of smaller libraries (referred ,o as aub-sets) wh=~ a fix- 
reardue la moved stepwrse through the secueuce of ammo acida or order gmupWsuch Z 
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peptidomnnetics [any compound that can be added to the substrate or inhibitor chain]). The 
result is that in each sub-set of the library the fixed residue is found in a different position: 
For example, in a library using four variable positions, five sub-sets in each Iibrnrv have ro 
be made, ZXXXX, XZXXX, XXZXX. XXXZX and XXXXZ where Z is the fixed residue 
and X are the four variable residues. We recognise that there may be a need in certain 
circumstances for further invariant residues, however these would occupy fixed positions 
and would not affect either the scanning or the seif-deconvo luring of the libraries. The 
invariant residue(s) might be fixed in position relative to the modifiable residue 2 or may 
be fixed in position relative to the overall motif sequence. Additional fixed residues can be 
added if desired, or one of the variable residues can be made invariant. In the later case the 
library would be a small pan of the libraries described here. Cases where it is desirable to 
include one or more fixed residues include libraries required to look at enzymes which 
always require another invariant residue in another position. However, in cases where two 
fixed residues are required, and they are both modified, it can be desirable to include this 
residue in one of the variable positions (i.e. make it one of the residues chosen in a variable 
position). The reason for this is that the sequence of events (the order in which the two 
residues become modified) can then also be probed by this scanning horary technique. In 
this case it may also be beneficiaLto make an additional library in which the fixed-esidue is 
not present at all, corresponding to the library XXXX. We would therefore have a library of 
six sub-sets. Tnese modifications are within the scope of the invention and would be 
recognised by someone skilled in the art. 

It can be readily seen that by combining the data from each library sub-set, the residues 
from -4 to +4 either side of the catalytic residue can be mapped: 

A-B-C-D-Z 
B-C-D-Z-E 



C-D-Z-E-? 
D-Z-E-F-G 
Z-E-F-G-H 



The capped sequer.ce would therefore be A-3-C-D-Z-E-F-G-H. 
Tne above examine usin^ 5 suiw* « r - • 

„.„ ■ - ' llDCT:K 01 ps ? ,!dK ° f 5 *=uao acids allows tb- 

mapptu- ox a sequence of 9 ^ ^ „ ^ 0M ^ "f- ,^ 

Wta. ^ reatdue type « any ^ posidon relldv5 „ ^ 
oinereni subsets, the data can be userf i„ . ■ •• 

~- . reqtured adjacen, t0 the fi,ed residue, then any sequences which conta^ 
mn un any of da. horary subsets can be cohered in an additive way. 

In this invention there is no ne-^ tn nmn. j- j - ■ 

„ f ,. , . , ™ se?rae moduieti ™ unmodified sequences be-as- 

of the sett deconvolurinst nature of the in™, -n. Uv 
the M r. - ■ , ~ ^' aS5iy SCrI=n P^ducea a series of hits 

panenas ot watch reveal the unique sequences in each we,,. Tnis enables a oa- ' o 
auosrnrte preferences to be detennmed forany enzyme. 

Tne unique sequences obtained using this invention cart he used to orovide subs-a~s *- 



in the an woulc realise that fay rraiacms the &-» „ 

» *«— r ^sious with a suitaole ccmDcurd that is 
~ » in^ror library „ b e c „. Ktm jJ^ 

sue „ ^ ^ or ^ ^ comuo ; 

Uorary win >, ^ ^ c-Ud ^ dne .ore rf 
prorotype ™ n or enzymes f or ^j^, ^ . 

dii*"?^ couid - be — 4- ,o oihtr «— ^ . 

deerred, „ *e ^ ^ , ^ ^ • 

ST-"- " ' ^ ta «~»« — « «"*-. Phosphor of m im 
xactors in cell lysates. 

in one e^odinaenr of rhi, tovenac * = s „ ^ 

d^;!T" which m " M ^ mor = ™« » «~ «S - ^ 

a Sttmgatiorward onented de»--— at . „_,;„•. 

e. at 1 W T u ■ af,Pr ° aCil UMd b - v Canll =y CSoMvan. 

ul ' ° OVe ' me!h ° d0lOSy ™ * *™ " *e 

^ry o, « baMd Wormrion ^ , M ^ b ^ 

from previously described methods. 

Pais invention allows data to be ohrnm-* • i 

ohvsio, ' ^ 5327,115 mmD,t0r ^"-^ Whi " «™» wi* the 

pnvsrologacal suosrmie for binding t o che acuve si B of the enz- CT = 



Description of the Drawings 

Figure 1. Design of protein tyrosine kinase library. Each peptide consists of a bioun cac an 
epsiion amino hexanoic acid spacer and 5 amino acids, including a . phosphorvlamble 
ryrosine residue. Each of the amino acid positions A-H is varied as described. For example, 
A I - i 0 means that position A is varied using 1 0 denned amino acids. 

Figure 2. Best substrates identified by screening tyrosine library sub-sets 1 to 5 against 
2AP-70 protein tyrosine kinase. The protein ryrosine kinase library described in Figure 1 
was phosphoryiared for 30 minutes at 30°C using the catalytic domain of human ZAP-70. 
Peptides were captured using strepavidin-coated 96 well plates and phospho tyrosine 
detected using ann-phosphotyrosine antibody, and mouse IgG-HRP and 
tetramemyibenridine (see experimental methods). Best substrates were identified as those 
which gave the highest amount of phosphate incorporation. 
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Figure 3. . Km Determination of Biotin- AHA-DEEDYFE(NIe) [SEQ ID NO. 3}. 
catalytic domain of human ZAP-70 was used to phosphorylate varying concentrations of 
peptide for 10 minutes at 30'C in the presence of 33 P- Y -A7P: Peptide capture was 
performed using strepavidin filter plates, scintillation fluid added, and counting performed 
using a beta-counter (see experimental methods). Samoies were assaved in trip lie— 
rtyvrj-i H +o n. Comfort d/6t- r ."tcb'o"i ^ rUc fil+T<s 
Description of the Invention 

Tnis invention provides for the active site mapping of enzymes which catalyse covaient 
modification including, but not limited to, phosphorylation, acylation, dephosphorvlation in 
which a residue such as a tyrosine, serine, threonine, histidine, aspanic acid or any other 
residue containing an appropriate side chain is modified. 



Thus, the invention provides a method for determining an amino acid sequence motif or a 
peptidomimetic sequence mod: containing an active site capable of being bound by an 
enzyme which catalyses covaient modification of a substrate molecule, comprising; 

a ) contacting the szyme with a library consisting of a number of oriented 
degenerate library subsets of molecules, each subset comprising 
unmodified degenerate motif sequences each having n residues and each 
having a modifiable residue at a different fixed non-degenerate nosition, 

. under conditions which allow for modification of molecules which are a 
substrate for the enzyme; 

b ) allowing the ^enzyme to modify modifiable residues in library subsets 
containing molecules having an active substrate site for the enzyme; 

c ) dec onvo luring the oriented degenerate library subsets of the library, in situ 
without separating modified from unmodified molecules, so as to reveal 
the sequence of any motif which has been modified by covaient binding of 
the enzyme; 

wherein each library subset is of formula (I) 

(Xaa), Zaa (Xaa) y (I) [SEQ ID No. 1 ] 

wherein 

Zaa is a non-degenerate modifiable narural or unnatural amino acid residue or 
p ep tidomimetic ; 

Xaa is any natural or unnatural amino acid residue or peptidomimetic; 

x and y are each independently 0 or an integer. 

(x + y)-(n-l);and 

n = an integer from 3 to 8, preferably 5. 
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ud. tnvendon can be ^ for ina.nc- to a protein ^ ^ ta ord=r to 

• - **-= 1 — protein Jcmase sub^-sr- 

sequences which aUows phannac 0o hore ^raor mrf - • 

• a-a«^»aon and assign or acnve site hhibuc'-s This 
invention can also be used to di—riv w„ n -s, • , . — - 

^ea to a,, __n y laenniy protein kinase inhibitor molecule- G . 1 

rormuia: (Xaa), Tyr (Xaa) y [SEQ.ID No. 2J. " 

In the first exemplification of this invention, a recombinant form of the human ,--70 
enzyme was used in an in phosphorylation reaction co phosphoryiate the five ^ 
suD-uoranes which scan the seauenc- -i r Q ^a , 

_ • nc ~ ^"ad ^ central tyrosine residue (F-W- n 

me libraries were arranged in Qn «/»n r-; • , = " 

^ wra^a in 5o w„U microtitre piate rormat with pools of 20 oe-bd-s in 

eacn microtitre well. However those sfcflleH m ,u. •« .- * * * " 

.nose SKiiiea m the an will realise that the Iibrarv c-n fa- 
constructed on anv scale. For examnle rh- iin^r,. c • r , ' *~ 

or e.^ple cfa- horary Suo-Set can be miniaturised on - "chic" 
scale or constructed on a Iant- bulk s-iieH^-,^ i. ~ 

5 DUJC Swaie aepenaing on the requirements of the library. 

Library peptides were made with bio tin taas which allow- 

uu ui_ s , wot, aiiowea pepade canture on streoavidin- 

coatea microtitre oiat^ rw- - 

Of- p.a.s. Dere.aon 01 pncspnotyrosine was achieved tism» ™. 

phospnotyrostne annfaody detecdon b » niti • - - 

subs^e and recording absorbance at 450 nn, Background absorbance reading of 0 1 to 
0 were recorded while the mghesr subsrmte pepbde-ue '.as 1.5. DeconvoUmoc of the 
te pepocea waa perfotmed as described in WO 9%Slf "cTei'aeSed w „ 
deconvoluted in I&nrv sub-sets I to i w „„,•'- t- ~" 

• ' t0 *• out not m >■ Ttas probably reflects the absolve 

reqtnrement of ZAP-70 for an amino acid residue in the -1 posinon. 

For the purpcae of this ettempdncauct, the oepddes used „=re tagged with d-Biodn and . 
tatcer (enailon amino hexanoic actd or some other spacing group). In pnnciole anv „ and 
hntcer eon be used, ahhough thts invendon aiso peovtcea that a tag and Unto does notL- 
oe present tf mass speennaoopv, for ettanapie, is used to idendfv the oeonde ha" To- 



purpose of the tag is solely to enable capture of all of the peptides (whether mooted or 
not) so that excess reagents car, be washed away. The reporting systems to detect ceotid- 
mcdincation can Include, but are not limited to, antibody recognition, radioactive assay or 
mass spectroscopy. 

In the library used to exemplify the invention, a Biotin tag was chosen because we beiiev-d 
that this would give improved results. The reasoning for this ch 01 ce of tag was because of 
the high level of positively charged groups on the enzyme in the area in which the tas sits. 
This charged area would cause unfavourable interactions with tags more commonly use bv 
others in the field, such a poly-lysine or poly-argmme. We would expect this reasoning to 
be applicable to any enzyme which binds a highly negatively charged molecule such as "but 
not limited to ATP, close to the peptide binding site. 

Tags are preferably non peptidic, with as little charge, either positive or nesarive, as 
possible. Biotin is a good example of this. The aim is to rninimise the interactions of the 
tag with the protein so the resultant hits are largely due to the binding of the Denudes rather 
than renecting the bmding of the tag. The best method of ail if this argument is anoiied to 
its logical conclusion would be to not use a tag at all and use mass spectroscopy c^oentirV 
the peptides. However, currently this approach is of limited value due to the time taken to 
run and analyse a library of the size used here to exemplify the invention. 

The results obtained from the horary screen clearly demonstrated amino acid residues 
preferred by the protein kinase at each of the -4 to +4 sites (Figure 2). The 5 mer pentides 
overlapped to give information on amino acid preference at each of the binding positions -4 
to *4. To confirm this a consensus peptide, Bi«iMAHA-DEEDYFE(Nle) [SEQ ID No. 
3], representing the best -4 to +4 amino acids was made and tested as a substrate (Figure 3). 
This substrate gave a Km against ZAP-70 of 15.79 uM, which is better than the best ZAP- 
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70 substrate described in the uterarure, a longer peptide of 14 amino acids with a tas of 3 
argmmes and a K m of 29 uM fwandenburg er c/. 1996). 

In the second application of this invention, a recombinant form of the human Svk enzvne 
was used in an in vivo phosphorylation reaction to phosphor/late the five sate— sub- 
Ubraries which scan the sequence -4 to +4 around a central tyrosine residue, as oreviousiv 
penonned for the ZAP-70 horary. Detection of phosphoryrosine was achieved urine ann- 
phosphoryrosine antibody detection in an ELBA assay using ten^ethviberidine 
substrate and recording absorbance at 450 nm. Background absorbance reading of 0 '0 
were recorded while the highest substrate peptidevaiue was 1.46. Devolution of the hit 
peptides was performed as described in WO S^l^S^nned substrates were 
deconvoluted in ail library sub-sets. A 

In the third application of this invention, a recombinant form of the human CSK enzvme 
was used in an in virro phosphorylation reaction to phospfaoryiate the five substrate sub- 
uomries which scan the sequence -4 to +4 around a central tyrosine residue, as oreviousiv 
performed for the ZAP-70 library. Detection of phosphoryrosine was achieved using and- 

phosphoryrosine antibodv de">"nnn ;-n ^ttca 

• QS — tion in an rUSA assay usmg termmeth^rcerimdine 

substrate and recording absorbance at 450 nm. Background absorbance readies of 0 04 
were recorded while the highest substrate peptitievalue was 0.22. Devolution of the hit 
peptides was performed as described in WO 97/4221 6 .^Cka^delmed substrates were 
deconvoluted in all library sub-sets. ^ 

In the fourth application of this invention, a recombinant form of the Abeison mur- 
leukaemia virus protein tyrosine kmase v-Abl was used in an in vivo ohosnhor.iation 
reacnon to phosphoryiate the library sub-set 4 which scans the sequence -1 to +3 around a 
zero position tyrosine residue, as previously performed for the ZAP-70 iibtarv. Detection of 



phosphotyrosine was achieved using anti-phosnhor/rosine antibody detection in an HLISA 
assay using tesramecfaylb«sdme substrate and recording "absorbance at 450 nm.' 
Background absorbance readings of 0. 1 1 were recorded while the highest substrate peptide 

^SLZg 4~^ e f nV ° iUti0n of ±c ?~ dd - s m Panned as described in WO 
97/422 1 6. Clear denned substrates were deconvolved in the library sub-set. 

In the tilth application of this invention, the invention was used to map the substrate 
specificity of a protein serine or serinc'threonine kinase (which inciude I-kappa B kinase 
beta and cAMP -dependent protein kinase fcAPK]). A protein serine or serme/threonine 
kinase enryme was used in an in virro phosphorylation reaction to phosphoryiace the five 
substrate sub-libraries which scan the sequence -4 to -4 around a central serine residue. 
The library was synthesised as the protein tyrosine kinase ZAP-70 library save that the 
tyrosine fixed residues were replaced with a serine which was then scanned through the five 
sub-libraries. Detection of phosphoserine was achieved using anti-phosphoserine antibody 
detection in an HLISA assay using tetramethylbenzidine substrate and recording absorbance 
at 450 urn. Deconvolution of the hit peptides was oenonned as described in WO 977^221 5* 

General formula: (Xaa), Ser (Xaa) y [SEQ ID No. 4]. 
Library Sub-Set 1 Xaa-Xaa-Xaa-Xaa-Ser 
Library Sub-Set 2 Xaa-Xaa-Xaa-Ser-Xaa 
Library Sub-Set 3 Xaa-Xaa-Ser-Xaa-Xaa 
Library Sub-Set 4 Xaa-Ser-Xaa-Xaa-Xaa 
Library Sub-Set 5 Ser-Xaa-Xaa-Xaa-Xaa 

Likewise the Library Sub-Sets can be synthesised for the mapping of threonine kinases by 
the synthesis of a library containing the threonine residue to allow phosphorylation by 
enzymes recoenisins this residu- 



It wffl be reahsed by those sidiled in the « te the repiaeetnei of the teao^iricn rep- 
aid, as the tyrosine, or serine, with a residue that is covaietttly modified bv"the e-^n- to 
oe mapped aiiows the active site of any such enzyme to be deterged aocordina to the 



invention. 



The invenrlon will now be described by reference to the following examples. 
Example 1 

In this example the invention was used to map the active catalytic site of ZAP-70 a orotein 
kmase enzyme that catalyses the phosphorylation of a tyrosine residue. The examnl- 
ulustrates the synthesis of a number of compounds, and the:r use as a sub-set library for me 
mapping of the enzyme so as to allow the subsequent identification and synthesis of sinele 
specific substrates for the enzyme. 

Synthesis of Peptide Compounds. 

Preparation of Crown Assembly 

The peptide compounds were synthesised in parallel fashion usine Fmoc-Rink-D AMD A 
denvansed gears (ex Chiron Mimotopes, Australia) loaded at approximately 1 6 uM D ~ 
crown. Prior to synthesis each crown was connected to. its respective stem and slotted into 
the 8 x 12 stem holder. Coupling of the ammo acids employed standard Frnoc ammo acid ' 
cnemistry as described m 'Solid Phase Peptide Synthesis:, E. Atherton and R.C. Sheooard 
IRL Press Ltd, Oxford, UK, 1989. 



RemovaJ of N -Fmoc Protection 
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_. -oO «, so,™ ™ ba[h is ^ ^ 200 ^ of a , 0% piper;d . n ^ 

rrr wis — * - «*»« «— * — r M t : 

wasncd consscunvelv with poo ml -acM nvr ,- 

l-uu ml cm), DMF (3 minutes) ^ MeOH „ ; 

mmnw, , naa) ^ kft t0 ^ ^ ftr . . • 
Quantitative UV Measurement of Fmoc Chrotnopbore Release 

A . ctn pad. ^ UV con „ charJed ^ u„,,ra ** pip^^MF sointion and 
osee » :=ro tbe ansorbance of ^ w a a wave!ra2ih Qf 290ot 

-o-d ■ ^ prepared consisting of 5.0 mg Fmoc^OBuO-Pepsv. KA ,0.08 
- ^ mi « a 20% pip = ridin , DlVff 3oMon . ^ ^ ^ Abs;M _ - 
room temperature, An ^ of ^ mul(ipm ^ ^ ^ I 

apnropnate » give a Abs;M . md ^ vijue com?arid ^ 

etc P enmentaHv measured absorbance snowing fce efficiency of ^ ^ 

Cou P Iing Of Standard Amino Acid Residues 

Coupling reactions were performed by ^ ^ ^ ^ 

9o weH piate wtth the P artetn of acavaI=d soMom ^ ^ ^ . 

counltng. Gear (appro, 1.6 utnole) standard couplings were petfood in DMF (300 ul). 

Coupling of an Amino-acid Residue To Appropriate Well 
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*uds: the multium assemb.y i, ^ ^ K . ?moc ^ ^ . d£ . 

T^ 30 *"» *» ° f — ) H6B, (.0 eauivalea.) BmH 

tor tne partite, round of coupling « ^ w , :ghtd ^ ^ 

Atea " Vely> 1PPr0priat = «**"» «*» «■* do ^tata» coloured fron, ft. 

° f "* C70Wn) ' ^ *» «* HBTC (9.9 equivalents cn,=ulated from 

the loadmg of each crown) and , g . H0B , ^ 

loaumg of each crown), nmm {1 „ fcm ^ ^ ^ ^ 

crown) were accurately weighed into suitable containers. 

Toe proceed and activated Fmoc amino acid derivatives were then dissolved in DM? (300 
.for each gear eg. for 20 gauss, 20 x 10 eq. , L6 ^ of derjvadve wouId bt 
m 10 mi DMF). tne appropriate derivatives were then dispensed to the anoronriate wells 
ready ior commencement of the -coupling cycle'. As a standard, couolina reacaous we- 
allowed to proceed for 6 hours. Toe coupled assembly was then washed as detailed below" 

Coupling of d-Biotin acid to pins 

d-Biotin (lOeqj, l-hydro^benzotriaaok^O (lOeoJ, BOP (9.95=,) and NMkf ( , 9 9-a) 
were dissolved in DMF (0.3ml per well) and agitated for 2 tninurea. 300 ul of so.unon 
was dispensed to each wail of a 96-well polypropyiene plate. Toe gears were then added to 
the soiurion and left for 2d hours. Fresh solution was made up, the aeass washed in DMF 
tor a tnmures and then added to the fresh coupling mixrure and left a former 24 hours. 

The put assembiy was removed from the piste, shaken free of excess liouid then hnm-nted 
-DMF (200ml) for amins. Toe assembiy was again shaken then hnmeraed in MeOK 
(_00itj_, 3 x Smins) and allowed to air drv. 



Washing Following Couolin 



If a 20% Dicericine/DMr ds^rnr^- fnn ^ ■ 

• • '-.-^-aon ts to anaeaiately follow the couolin* cvck the 

^ USaBb "' * bnef ' y » —™ -*« cashed co^velj; • 

(200 a, ecch), MeOH (5 nainthea) and « (i ^ _ ^ fi , V™ 

i. - be .ored or reocted ^ to . &u ^ ^ 

taw. sna.<mg men consecutive washes with (200 mi -ach^ D^nr ™ 

v nil -acnj, UMr (o minutes) and M-np- ( '5 

minutes, 5 minutes, 5 minutes) is performed. 

Following these seneral methods rhe nr,nn. i- 

metnods, the pepnae notaries shown in Figure I wers seau-ciailv 

™- oy ^ the appelate coupling at the correct cyC^I 

synthesis. - 

Acidolytic Mediated Cleavage of Peptide-Pin Assembly 

Add abated cleavage protocol were sn-ictly petfood in a ta= hood. A po.vsrvren- 
9o w=U ptate (I n^well, waa Veiled, then the tare wetght measured to the Lose ™ 
Appropriate welht were charged with a difiuoroacchc acid/diiaopropyisHane (95ta, v,V, ^ 
^cienvage solunon, in a panent copending ,o that of the naulhpin aaaetaaoiy to he 

The nauitipin aaaetnh.y „ added, «he etadxe conduct covered in hn foil and left for 2 houra 
Tne nauteptn asaenahly in then added » anocher polyene 96 well ote e (1 „. 
contananag ™ac=dc actd, C aethy.s i ,ane (95:5, v/v, 600 ; a„ (aa ahove) for 5 tninu.es. " ' 

Work up of Cleaved Peptides 
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rne primary polystyrene cleavage plate (2 hour- cleavage) and the secondary poivsryr-a- 
plate (5 minute wash) were then placed in the SpeedVac and the solvents removed 
(minimus: drying rate) for 90 minutes. 

Tne contents of the secondary poiysryrene plate were transferred to their corresponds 
wells on the primary plate using an acetonitrii^water/acetic acid (50:45:5, v/v/v) solution 
(3 x 150 ul) and the spent secondary place discarded. 

Analysis of Products 

A 5uL aliquot from each well is diluted ro 100 ul wich 0.1% aq. TFA, then a lOuL aliouot 
from this plate diluted with a further 100 ul 0.1% aq. TFA. Tne double diluted oiate was 
analysed by HPLC-MS. 

Final Lyophilisarion of Peptides 

Tne plate was covered with tin foil, held to the plate with an elastic band A oin nrick was 
placed in the foil directly above each well and the plate placed at -80°C for 30 minutes The 
plate was then lyophilised on the 'Heto freeze drier' overnight. Finally, the dried olate was 
weighed. Tne total cleaved peptide was quantified (by weight) and the averase content ox- 
each peptide calculated. Since all the peptides present have originated from the same 
peptide-pin assembly, cleaved under identical conditions, it is reasonable to assume that the 
contents of each well are roughly equimolar. 



Protein kinase cloning, expression and purification 
Polymerase chain reaction (PCR) and downstream cloning 
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The coding sequence for human ZAP-70 amino add 306-615 was amplified from JurkatT 
cell cDNA by PCR (2 minutes at 94'C, followed by 35 cycles of 15 seconds 94°C 30 
seconds 65°C, 2 minutes 72°C and a final single 5 minute 72°C incubauon) usine'the 



5' CCGGGATCCGCCATGCCCATGGACACGAGCGTGTAT 3 • [SEQ ID No. 5] 

5' GGGGGATCCTCAGTGGTGGTGGTGGTGGTGGGCACAGGCAGCCTCA.GC 
CTTCTGTGT 3 ' [SEQ ID No. 6] 

The PCR ampiicon was cloned into the Bam HI site of P UC19 and sequence connnn-d 
using MI3-20 and reverse primers on an Applied Biosystems Prism 310 seouenc-r as 
descnbed by manufacturer. The Bam Hi ZAP-70 insert was excised from the secuencin* 
vector and ligated into the Baculo viral transfer vector P AcUW5 1 (Phaimingen). 

Generation of ZAP-70 enzyme using baculovims 

Homologous recombination with wild type baculoviral DNA was then performed in SB 
insect cells and viral supernatant harvested. Plaque purified virus was exposed^ several 
viral amplification steps then used at a titre of 3xl0» PFU/ml to infect 3 1 IxlO 6 cells/ml SO 
cells at an MOI of 10 in an Applecon bioreactor using 6 0 o/ 0 dissolved oxygen. Cells were 
harvested 3 days post infection. 

Protein purification 

The infected cell pellet was Ivsed in 50 mM Tris-HCl, pH 7.5, 0. 15 M NaCI, 25% sucrose 
1 mM ^utrophenol phosphate, 1 mM sodium orthovanadate and protease inhibitors. 



20 



Following homogenisation using a dounce pestle B, the cleared Ivsate was loaded onto a 
cobalt-sepharose column. After column washing with lysis buffer, elution was oerformed 
with an imidazole gradient and ZAP-70 fractions identified by protein kinase activirv 
against the peptide substrate Lys-Lvs-Lys-Lvs-Ala-^-Glu-Glu-.^-Tyr-Phe-ne-Pro-Pro- 
Ala as described in Casnellie et al, 199 1 



Library Screening 

Library peptides were phosphoryiated in pools of 20 peptides at a final concentration of 1 
uM total peptide in 50 mM HEPES. P H 7.5, 0.1% Triton X-100, 100 uM ATP, 10 mM 
MnCl 2 , 1 mM DTT and 0.2 mM sodium orthovanadate for 30 minutes at 30°C. Tnese 
reaction mixtures were then stopped using 100 mM EDTA, 6 mM adenosine, transferred to 
strepavidin-coated microtitre plates and allowed to bind for 30 minutes at 20°C. 
Unincorporated reaction products were washed from the plate using PBS/0.1% Tween 20 
then plate incubation performed with anti-phospho tyrosine antibody (Sisma mous- 
monoclonal clone PT66) in 2% B S A/PBS/Tween for 1 hour at 20°C. Unbound antibody 
was removed by plate washing using PBS/Tween then incubation performed with rabbit 
ami mouse-HRP (Amersham) for a further 1 hour. HRP detection was then performed with 
tetramethylbenzidine (TMB) substrate and measuring absorbance at 450 nM using a 
spectrophotometer (Dynex MRX). 

The. best substrates were identified as those which gave the highest amount of phosphate 
incorporation. The library subsets were deconvolved according to the teaching of 
W097/42216: this gives an immediate determination of the unique sequence of'any 
phosphoryiated motif without the need for further synthesis or sequencing. (Figure o [SEQ 
ID Nos. 7,8,9,10]). 



Peptide K m determination 
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Bio tin-tagged peptides were phosphoryiated at varying concentrations in 50 m_M HEPES, 
pH 7.5, 0.1% Triton X-100, 200 uM ATP, 10 mJVI MnClj, 1 mM DTT and 0.2 miM sodium 
ortbovanadate using 0.5 uCi "P-y-ATP for 10 minutes at 30°C. The reactions were stopped 
using 2 M guanidine hydrochloride, diluted 1 in 10 in water then 5 ul reaction mix spoiled 
onto SAM™ titre plates (Promega). Unincorporated reaction products were washed as 
described by manufacturer then 20 ul scintillation liquid added and plate counted on a 
Packard TopCount beta-counter. Km was calculated using a non-linear one site hyperbola 
model (ATP was added in excess to negate influence of the ATP binding site on the 
substrate site kinetics). (Figure 3). 

Example 2 



In this example the invention was used to map the active catalytic site of Syk, a protein 
kinase enzyme that catalyses the phosphorylation of a tyrosine residue. The example 
illustrates the positional scanning of the sub-set libraries for the mapping of the enzyme, 
and their use so as to allow the subsequent identification of the preferred substrates of the 
enzyme catalytic site. 

The mapping and assessment of the catalytic site was performed as detailed in Example 1. 
The substrate preferences were deconvolved as detailed in WO 97/42216 and are detailed 
below. 



Library Sub-Set 1 
Library Sub-Set 2 



Asp-Glu-Glu-Asp-Tyr [SEQ ID No. 11] 

Asp-Glu-Glu-Tyr-Asp [SEQ ID No. 12] 
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Library Sub-Set 3 Asp-Glu-Tyr-Giu-Asp [SEQ ID No. 13] 

Library Sub-Set 4 Asp-Tyr-GIu-Glu-Val [SEQ ID No. 1 4] 

Library Sub-Set 5 Tyr-Ser-Ile-Ile-Nle [SEQ ID No. 15] 

Example 3 

In this example the invention was used to map the active catalytic site of CSK. The subset 
library was used to scan the enzyme active site so as to allow the subsequent identification 
and synthesis of the preferred specific substrates for the enzyme as listed below. 



Library Sub-Set 1 Asp-Glu-Giu-Glu-Tyr [SEQ ID No. 16] 
Library Sub-Set 2 Asp-Glu-Glu-Tyr-Phe [SEQ ID No. 17] 

Library Sub-Set 3 Asp-Glu-Tyr-Kis-Asn [SEQ ID No. 18] 

Library Sub-Set 4 Asp-Tyr-His-Leu-Phe [SEQ ID No. 19] 

Library Sub-Set 5 Tyr-Pro-Ile-Glu-Val [SEQ ID No. 20] 

Example 4 

In this example the invention was used to map the active catalytic site of v-Abl, a protein 
kinase enzyme that catalyses the phosphorylation of a tyrosine residue. For this enzyme 
only the Library Sub-Set 4 (i.e. X-Tyr-X-X-X according to SEQ ED No. 2) was scanned 
with the enzyme. The active site substrate recognition substrate for this enzyme for this 
Sub-Set was Serine-tyrosine-phenylalanine-histamine-giutamine [SEQ ID No. 21]. 
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